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Abstract. Considering mean-variance portfolio problems with uncertain model parameters, we 
contrast the classical absolute robust optimization approach with the relative robust approach based 
on a maximum regret function. Although the latter problems are NP-hard in general, we show that 
tractable inner and outer approximations exist in several cases that are of central interest in asset 
management. 
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1. Introduction. We consider decision-making tools for problems that are de- 
fined by uncertain or unknown parameters. Uncertainty and the risk of undesirable 
outcomes are inevitable features of most productive activities. From engineers to 
economists, from health care providers to investment managers, many professionals 
must take decisions under considerable uncertainty on a daily basis. Our objective 
is to improve the quality of decisions made in these environments by understanding, 
modeling, quantifying and managing uncertainty. To achieve this goal we propose a 
robust optimization modeling methodology based on extending the idea of relative 
robust optimization of Kouvelis and Yu jlS^ in the context of discrete optimization. 

Many decision problems with uncertainty can be formulated as optimization prob- 
lems. In recent years, robust optimization (RO) has emerged as a powerful tool for 
managing uncertainty in such optimization problems [H [5] . An excellent overview can 
be gained from the recent survey paper [2], Robust optimization is a generic term 
that is used to describe a class of modeling strategies as well as solution methods 
for optimization problems that are defined by uncertain inputs. Decisions made with 
incomplete information may result in undesirable outcomes when the realized values 
of the uncertain inputs are unfavorable. Robust optimization models and algorithms 
aim to mitigate the effects of uncertainty and obtain a solution that is guaranteed to 
perform reasonably well for all, or at least most, possible realizations of the uncertain 
input parameters. 

There is growing evidence, both empirical and theoretical, that robust-optimized 
solutions have better characteristics than their non-robust counterparts. For example, 
a theoretical study by Schottle and Werner demonstrates that the map between the 
model parameters of an optimization problem and its set of optimal solutions can 
become much smoother if one uses robust optimization with ellipsoidal uncertainty 
sets [19], [20]. In a portfolio optimization setting with uncertain expected return es- 
timates, Ceria and Stubbs report simulated results where the ex-post performance of 
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robust-optimal portfolios outperform those of standard mean-variance optimal port- 
folios with high frequency [6]. 

While the stated goals of robust optimization are intuitive, it is not always clear 
what metrics one should use to achieve these goals. Most robust optimization ap- 
proaches discussed in the existing literature use the "worst-case objective value" as 
the comparison metric among alternative sets of decisions. Despite the advantages we 
mentioned in the previous paragraph, the focus on the worst-case objective value in 
robust optimization is a source of frequent criticism. Modelers, often with good rea- 
son, worry that the extreme scenarios in the uncertainty set have an undue influence 
on the final decisions in such robust formulations. The "worst-case objective value" is 
an absolute metric. While there are many situations where it is the appropriate metric 
for evaluating robustness, it is inadequate for measuring robustness in a relative sense. 

A typical example arises in the investment management context where managers 
are frequently evaluated and compensated based on their performance relative to the 
competition. For robust decision- making in an uncertain decision environment, rather 
than protecting themselves against worst-case scenarios, investment managers may 
thus prefer to choose decisions that avoid falling severely behind their competitors 
under a range of scenarios. This view of robustness was formalized by Kouvelis and 
Yu [13] • For each choice of the decision variables and each scenario one compares the 
attained objective value with the optimal objective value attainable under the model 
parameter values described by the scenario. The difference between these two values, 
or alternatively their ratio, can be seen as measures of regret based on hindsight 
after the true values of the uncertain parameters are revealed. With the objective 
of limiting such regret measures Kouvelis and Yu arrive at the robust deviation and 
relative robust decision problem formulations. We will provide these formulations in 
Section [2] as they form the focus of our study. We will refer to the robust deviation 
and relative robust decision problems collectively as relative robust problems. 

While Kouvelis and Yu explore robust deviation and relative robust decisions in 
several classes of discrete optimization problems, similar studies for continuous opti- 
mization problems are mostly missing in the literature. A rare exception is Taguchi's 
master's thesis [25] and the subsequent paper . As these authors also observed, rel- 
ative robust formulations are typically more difficult than the corresponding absolute 
robust formulations. Since they involve the optimal value function whose argument 
is the vector of uncertain parameters inside a min-max optimization problem, rel- 
ative robust problems are three-level optimization problems. This is in contrast to 
the two-level absolute robust formulations. Since the optimal value function is rarely 
available in closed form, tractability is an important concern for these models. Our 
study shows that for many uncertainty structures on quadratic programming and 
other optimization problems, the resulting relative robust formulation can be reduced 
to one or a series of single-level deterministic optimization problems that can be solved 
using conic optimization methods. 

The simplest uncertainty sets are finite sets, corresponding to the intuitive notion 
of a collection of scenarios. Both the absolute and relative robust formulations with 
finite uncertainty sets are relatively easy as they can be solved as a finite sequence 
of deterministic problems. Using simple convexity arguments, we show that robust 
problems with polytopic uncertainty structures (uncertainty sets defined as convex 
hull of a finite number of points) can be reduced to the finite case and are therefore 
of the same complexity. Relative robust optimization problems with polytopic uncer- 
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tainty sets were also considered by Taguchi et. al. |26| . but our discussion is based on 
the second author's MSc thesis P3] which predated the work of the aforementioned 
authors and formed the basis of the original draft of this paper. Gregory et. al. [TU] 
also investigated the polyhedral case. 

In the second part of the paper we move beyond polytopic uncertainty sets and 
consider relative robust models with ellipsoidal uncertainty. In the application on 
which we concentrate, mean-variance portfolio optimization, we assume ellipsoidal 
uncertainty only in the vector of expected returns and assume that the covariance 
matrix is fixed. This assumption is justified, as practitioners typically use matrix 
shrinkage and factor models to ensure that the covariance is estimated robustly. El- 
lipsoidal uncertainty sets for the vector of expected returns appear quite naturally 
as confidence regions in their statistical estimation. Taguchi et. al. [55] propose to 
approximate ellipsoidal uncertainty sets by a polytope obtained as the convex hull of 
a random sample of points from the ellipsoid. The resulting relative robust problems 
are relaxations of the original relative robust problem with ellipsoidal uncertainty, and 
since these relaxations are based on polyhedral uncertainty, they can be solved via 
the approach discussed earlier. Adorn [T] investigated similar relaxations but based 
on polyhedra generated by pseudo-randomly chosen points, which leads to faster con- 
vergence. 

Our approach to relative robustness under ellipsoidal uncertainty is very different. 
By developing inner approximations to the relative robust problem in the form of a 
symmetric cone programming problem, we obtain solutions that are guaranteed to be 
feasible. In several cases of interest our inner approximations are provably tight, as 
we show using a theory developed by Sturm and Zhang |24) . Thus, while our models 
approximate the feasible set of the relative robust problem from the inside, Taguchi 
et. al.'s approach 2(i approximates it from the outside, which yield solutions that 
are not guaranteed to be feasible but give a bound on the optimal objective value. 
The two approaches can be combined to obtain approximation guarantees for both - 
a distance to optimality in the case of our model, and a distance to feasibility in the 
case of the model of Taguchi et. al.. 

Since we have in mind the mean-variance portfolio optimization framework of 
Markowitz |15] as a particular application of the methods investigated in this paper, 
most of the notation we use will be inspired by this framework which is explained 
in some detail in Section [3] Much of the other notation is of standard use in the 
optimization literature, such as y and >- to denote positive semidefiniteness and 
definiteness of a matrix, for example, cone(-), conv(-) and aff(-) for conic, convex and 
affine hulls of a subset of a vector space respectively, • for the trace inner-product of 
two matrices of equal size, ■* for duals of functionals and cones, e and I for the vector 
of all ones and the identity matrix of appropriate size, and - T for the transpose of a 
matrix. 

2. Absolute Robust versus Relative Robust Optimization. We consider 
a generic optimization problem whose input parameters are denoted by the vector p. 

max/(x,p) (2.1) 

where f{x,p) and X v represent the objective function and the feasible set of the 
problem. For a given p, let z*(p) and fl*(p) denote, respectively, the optimal value 
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and the set of optimal solutions of the above given problem, provided that they exist. 
We will also use the notation x*(p) to denote a generic element of f2*(p). 



2.1. The Absolute Robust Optimization Framework. When p is known 
(12. ip is a standard optimization problem. Robust optimization (RO) is concerned 
with the case where p is not known with certainty. In recent years, RO has emerged 
as an alternative to traditional approaches to optimization under uncertainty such as 
sensitivity analysis and stochastic programming. As mentioned in the introduction, 
its primary objective is to find solutions that will have a good performance under a 
variety of scenarios for the uncertain input parameters. RO models are especially well- 
suited in situations where there are constraints with uncertain parameters that must 
be satisfied regardless of the values of these parameters or when the optimal solutions 
are particularly sensitive to perturbations. Additionally RO is an attractive modeling 
option when the decision-maker cannot afford low-probability high-magnitude risks. 

One of the essential elements of a RO model is the uncertainty set. The uncer- 
tainty set, say tyf , represents the set of possible scenarios/realizations for the param- 
eters p. When p is uncertain and must be estimated, uncertainty sets can represent 
or be formed by difference of opinions, alternative estimates, confidence regions of 
statistical estimators, or based on Bayesian or Kalman filtering methods for tracking 
the evolution of an assumed probability distribution for p. While the current litera- 
ture does not provide clear guidelines on their construction, their shape often reflects 
the sources of uncertainty while their size depends on the desired level of robustness. 
Common types of uncertainty sets include: (i) ^ = {p\,p2, ■ ■ ■ ,Pk} (a finite set of 
scenarios), (ii) ^/ = conv(pi,p 2 , • ■ • ,Pk) ( a polytopic set), (hi) % = {p : I < p < u} 
(intervals), and (iv) ^ = {p : p = po + Mu, \\u\\ < 1} (an ellipsoidal set). 

RO formulations optimize some variation of a worst-case performance metric, 
where the "worst-case" is computed over the uncertainty set. In most cases [6|l8ll9llTT]. 
the objective is to optimize the worst-case realization of the objective function. For 
the optimization problem (|2.ip . this leads to the following formulation: 



Kouvelis and Yu [13] classify (|2.2|) as the absolute robust decision problem. This 
name reflects the fact that the worst-case objective value is an absolute metric. One 
potential consequence of this emphasis on the worst-case is that the decisions are 
disproportionately affected by extreme scenarios in the uncertainty set. As this is not 
always desirable, Bertsimas and Sim [3] study this cost of robustness as a function of 
the level of conservatism. An alternative we consider in this paper is to seek robustness 
in a relative sense. 

2.2. The Relative Robust Optimization Framework. For this purpose, we 
consider a regret function that measures the difference between the performance of 
the solution with and without the benefit of hindsight. If we choose x as decision 
vector when p is the vector of realized parameter values, then the regret associated 
with having chosen x rather than x*(p) as decision vector is defined as follows, 



Note that since x* (p) is an optimal decision vector for the parameter values p, the 
regret r{x,p) is always nonnegative. 




(2.2) 



r(x,p) := z*(p) - f(x,p) = f(x*(p),p) - f(x,p). 



(2.3) 
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The regret function is not useful at the decision-making stage since we cannot 
measure the regret before we observe the realized value of the parameters. Further- 
more, in many a context p cannot be observed even after its realization. For example, 
financial data typically yield a single sample of a random return vector R while the 
parameters p = (E[i?], Cov(i?)) that serve as input parameters to the optimal invest- 
ment problem are neither directly observable nor inferable from this single sample. 
For this reason we consider the maximum regret function instead, which provides an 
upper bound on the true regret, 

i?(x) := maxr(i,p) = max (z*(p) — f(x,p)) . (2-4) 

If the function z*(p) is positive everywhere, one can also consider a scaled version of 
the regret function, 

f{x p) _ i-M-Mri , (2 . 5) 

~ z*(p) — f(x,p) 
R(x) = maxrla;, p) = max . (2.6) 



In Kouvelis and Yu |13j . vectors x that minimize the maximum regret functions 
R(x) and R(x) are called robust deviation decisions and relative robust decisions re- 
spectively. We collectively refer to problems seeking such decisions as relative robust 
problems and focus on the function R(x) for most of the rest of our discussion. 

Let us consider the simpler case where the uncertain parameters are only in the 
objective function and the feasible set 2£ v = JT is independent of p. In most models, 
the dependence of the objective function on the uncertain parameters is linear. When 
this is the case, it is easy to see that the optimal value function z*(p) is a convex 
function. In fact, it is sufficient that / be convex in p to guarantee the convexity of 
z*(p): 

Lemma 2.1. Let ^ be a convex set. For all p £ , define 

z*(p) = sup f{x,p) 

where f is convex in p. Then z* is a convex function on °i/ . 

Lemma [2~T1 is part of the folklore on convex analysis. For the sake of completeness 
we include a proof in Appendix A. As we will see in the next section, this simple 
convexity result is responsible for reducing relative robust optimization models with 
polytopic uncertainty sets to problems with finite uncertainty sets. 

3. Application to Mean- Variance Portfolio Optimization. Portfolio the- 
ory deals with the problem of deciding what proportion of investable wealth to al- 
locate to each of several risky investment opportunities so as to achieve a chosen 
goal, which is usually to maximize the expected return while limiting risk. Under the 
mean- variance optimization (MVO) approach of Markowitz |15j . all n investments are 
assumed to be held during the same fixed investment period over which they generate 
random returns Ri. Assembled in a random vector R, the expectation /i = E[i?] and 
the positive definite covariance matrix Q = Cov(R) of the asset returns serve as in- 
put parameters to one of the quadratic programming problems (|3.1[) - (|3.3[) described 
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below. Their solutions yield optimal portfolio weights. Though these problems are 
convex and computationally tractable and can therefore be solved to global optimal- 
ity, MVO models can produce portfolios that are highly sensitive to the values of the 
model parameters (//, Q) and show unsatisfactory diversification. Since (ji, Q) have 
to be estimated statistically, output sensitivity to these parameters is an important 
practical issue. Robust optimization models have therefore emerged as favourable 
alternatives to plain MVO models [9ll28ll6j. 

3.1. Classical Mean- Variance Portfolio Models. Although the mathemat- 
ical methods investigated in this paper are applicable more widely, the MVO frame- 
work constitutes their main motivation and application. We shall therefore briefly 
describe some of the models that arise in this context. Let Xi be the proportion of 
wealth invested in the i-th investment opportunity (or asset), and let these weights 
be collected in a vector x of size n. The portfolio corresponding to the weights x then 
has the overall return R T x with expectation p^x and variance x T Qx. Apart from 
the budget constraint e T x — 1 (where e :— [1 ... i] T ), fund managers usually restrict 
the set 3£ of feasible portfolios (investment decisions they are willing to consider) 
by introducing further constraints that impose limits on short-selling, diversification, 
rebalancing costs and other criteria. Typically, all constraints are linear, leading to a 
polyhedral feasible set J = {i£ l n : Fx = /, Gx < g}. Here we make the minimal 
assumption that be a convex tractable set, that is, a set for which it can be decided 
in polynomial time whether or not a given point is a member. 

3.1.1. Convex MVO Models. Taking the variance of the portfolio return as 
a risk measure, MVO formulations are obtained by either choosing to minimize the 
variance subject to a lower bound target return p, to maximize the return subject 
to an upper bound target risk a 2 or to maximize the risk-adjusted expected return 
p T x — Xx T Qx defined by a specific choice of a risk-aversion parameter A > 0, 

min /q(x) :— x T Qx (3-1) 

S.t. fl T X > p, 



max f Jx) := p T x (3.2) 
s.t. x T Qx < a 2 , 



max fn,Q(x) :— p7 x — \x T Qx (3-3) 
s.t. x e S£. 

It is well-known and can easily be established using KKT conditions that the three 
formulations presented above are equivalent in the sense that they produce identical 
solutions for appropriately chosen values of p, a 2 and A. For example, there exists 
a function <j 2 (p,Q,r) such that the solutions of (|3.1|l and (|3.2j) coincide when a 2 = 
a 2 (p,Q, p). We note however that this functional dependence also depends on the 
model parameters. A portfolio x is said to be efficient if it optimizes (|3.1I) - (|3.3I) for 
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some choice of p, a 2 and A respectively. Because of the above-mentioned equivalence, it 
does not matter which problem we use for this definition. The set {((x T Qx) 1 / 2 , [i T x) : 
x efficient} is the efficient frontier. 

3.1.2. Sharpe Ratio Maximization. Another variant of MVO is the Sharpe 
Ratio maximization problem. Let r be the return of a risk-free investment held over 
the same period as the above considered assets, e.g., a short-term government bond 
or the money market. If this risk- free asset is included among the considered assets 
- we call it asset and refer to it as cash - and if the feasible set 3£q of portfolios 
containing a cash position is of affine form, 

£- = aff({[°] : -TGJT}U {[J]}), (3.4) 

where aff(-) denotes the affine hull of a set and 3£ is the set of feasible portfolios 
containing only positions in the risky assets, then cash can freely be borrowed to 
invest in the risky assets. In this case the efficient frontier is a straight line going 
through the point (0, r) and with a gradient given by 

max — ; (3.5) 

*6R" y/x T Qx 
S.t. 



The Sharpe ratio [52] of a portfolio x is defined as the ratio of the excess expected 
return of the portfolio over the risk-free asset and the standard deviation of the 
portfolio return. Correspondingly, Model p.5[) is called the Maximum Sharpe Ratio 
problem (MSR). In the case where this problem has a unique optimal solution, this 
solution is called the market portfolio. It can be easily seen that any efficient portfolio 
is then an affine combination of cash and the market portfolio. This observation forms 
the basis of Sharpe's capital asset pricing theory |21) . 

While (|3.5p is a nonlinear and nonconvex problem, it can be solved by the tractable 
convex programming problem 

m ax5(y) = (// - re) T y (3.6) 
s.t. yeM+S", 

y T Qy < l. 

See Appendix B for detailed explanations, and also [H| and [55] for similar techniques. 

3.2. Absolute Robust Portfolio Models. Motivated by the sensitivity of the 
solutions of Problems (|3. II) , (|3. 21) , (|3.3[) and (|3.5I) as functions of the model parameters 
(fi, Q), we next consider robust counterparts of these models. Depending on how the 
uncertainty set ^ for the model parameters (/it, Q) is chosen and which of the models 
one chooses to robustify, one arrives at different robust formulations. We note that 
the ensuing models are no longer all equivalent in the sense in which the nonrobust 
versions (|3 . If) — (|3. 3[) were. The main reason for this difference is that the feasible sets 
of Problems (|3.ip and (|3.2|) depend on the model parameters (p, Q) while those of 
Problems (|3.3[) and (|3.5[) do not, with the consequence that in the context of the 
former two problems joint uncertainty structures in and Q cannot be exploited, 
while in the context of the latter two they can, at least conceptually. 

7 



To be more specific, let us write :— {fi : 3 (/i, Q) £ tyf} for the projection of 
% onto the /^-component, and *%q := {Q : 3(/i,Q) £ <&} for its projection onto the 
Q-component, and note that when /i and Q have joint uncertainty structure, then 
% is usually strictly contained in the Cartesian product ^ M x ^q. Yet the absolute 
robust counterpart of (13.11) has the following equivalent formulations, 

mm I max x T Qx 



<=>• min ( max x 1 Qx I , (3-7) 

<^> mm I max x T Qx 



Thus, the joint uncertainty structure of fj, and Q cannot be exploited in the context of 
the robust problem (I3.7I) . and neither can it be in the context of the absolute robust 
counterpart of (|3.2|) . 

,T, 



max min ax). (3.8) 

{ieaT:max Q( :* Q x T Qx<a 2 } / 

In contrast, joint uncertainty in /i and Q can be exploited, at least conceptually, in 
the framework of the absolute robust counterpart of (|3.3[) . 

max ( min u T x — Xx T Qx ) , (3.9) 
x&sc \(ii,Q)ew J 

as well as in the absolute robust counterpart of (|3.5|) . 

/ t \ 

max min — , . (3.10) 

Special cases of the above-described models appear in the literature as follows: 
Goldfarb and Iyengar [S] discussed the problems (|3.7[) , (|3.8p in the case where ^ is 
an uncertainty set of Cartesian type = ^ X corresponding to a confidence 
region for the statistical estimators arising in the context of the load-factor model for 
(n,Q). Halldorsson and Tutimcu [11] discussed the problem (|3.9[) in the case where 
a i/ = ^ x and where ^ is a box of confidence intervals for the individual 
components of /it and °}/q is a box of confidence intervals intersected with the cone of 
positive semidefinite symmetric matrices. 

Note that the absolute robust problems (|3.1|) - (|3.3|l are all two-level optimization 
problems and thus a priori harder to solve than the classical nonrobust models (|3.1|) - 
(|3.3|) . However, when is tractable, then the robust problems are tractable too. See 
the above cited papers and the other literature on robust optimization for details. 

3.3. Relative Robust Portfolio Models. To contrast the relative robust frame- 
work with the absolute robust setting, we next formulate relative robust counterparts 
of problems f33 ]) -([0 ]) . 

Let us first consider problem (|3.1I) . which has two meaningful relative robust 
analogues: In the first version, 

z*(fi,Q) := min y T Qy, 
{y£.9> :p T y>p} 



is defined as the maximum objective value achievable by an omniscient adversary (one 
who knows the parameters (//, Q) with certitude), leading to the maximum regret 



R(x) := max (x T Qx — min y T Qy) 
(**,Q)e* {y&-T:^y> P } 

and the relative robust problem 

min ( max (x T Qx — min y T Qy)). (3.11) 

In the second version regrets are computed merely relative to the solution of a fortu- 
itous adversary who is bound to choosing a portfolio that is feasible for all parameter 
values in but happens to choose the one that is optimal among these for the true 
parameter values. Thus, one would have to define 

z*{li,Q) := min y T Qy, 

{ye.% : miivg^ v 1 y>p\ 

R(x) := max (x T Qx — min y T Qy), 

which leads to the relative robust problem 

min ( max (x^Qx — min y T Qy))- (3.12) 

{xeX: min^e^ p T x>p} Qe^Q {ye X : min„ e ^ i/ T y>p} 



Note that, similarly to what we observed in the context of Problem (|3.7j) . the de- 
pendence of the feasible set of Problem (|3.1[) on the model parameter /i introduces 
limitations on the exploitation of joint uncertainty in /i and Q. The emergence of two 
conceptually different relative robust analogues is also due to this dependence. 

In complete similarity, Problem (|3.2j) has the following two relative robust ana- 
logues with similar limitations on the exploitation of structured uncertainty sets, 

min ( max ( max LL T (y — x))), (3.13) 

{xGir:max ce ^ Q x T Cx<a 2 } y (p,Q)<EW {ydX :y T Qy>a 2 } ' 

min (max( max /i T (y — a;))) . (3-14) 

Let us now turn our attention to the relative robust counterparts of (I3.3P and 
(13. 5[) . In these cases, the feasible set is independent of the model parameters, and 
for any given uncertainty structure there exists only one relative robust counterpart 
model. 

3.3.1. The Relative Robust Counterpart of Problem (pO]) . We define 

z*((i, Q) := m&x(fi T y - Xy T Qy), 

year 

R(x) :— max (max(^ T y — \y T Qy) — (l^x — x T Qx)). 

(p,Q)GW y£% 

Problem (|3.3[) then has the following relative robust counterpart, 

min ( max (max(/i T y — \y T Qy) — (li x — x T Qx)]). (3.15) 



Introducing an artificial variable 7 that expresses an upper bound on the regret, we 
can equivalently reformulate this problem as follows, 



min 7 (3.16) 

(x,7)eR" + 1 

S.t. X (E r^f" . 

j>z*(fj,,Q)-fj, T x + x T Qx, V(/x,Q) e 

Alternatively, using the definition of z*(p, Q), we obtain another equivalent formula- 
tion, 

min 7 (3.17) 

(x,y,y)m 2n+1 

S.t. X £E j 

7 > M T y - y T Qy - v T x + x T Qx, V (/i, Q)e^,i/e JT. 

We will use the formulation p. 161) in Section 0] while the formulation (|3.17[) will be 
preferable in Section [3] 

3.3.2. The Relative Robust Counterpart of Problem (|3.5p . We now de- 
scribe the relative robust maximum Sharpe-ratio problem and reformulate it using 
the convexification approach described in Appendix B. 

We will use the notation introduced in Section 13.1.21 and Appendix B, and we 
assume that the feasible set 3£q of portfolios containing a cash position takes the 
affine form {33}. 

For any given (/i, Q) £ °i/ the maximum Sharpe Ratio achievable in 3£ is given 

by 

z*(u,Q) := max (M 7 re)T:r (3.18) 

and can be computed by solving a convex problem of the form Q3.6P . 

Furthermore, introducing an artificial variable 7, the relative robust counterpart 
of Problem (|3.5[) can be formulated as follows, 

min 7 

s-t. {ji ~ r ^ X >**(//, Q)- 7, V(,i,Q)e^. 
yx'-Qx 

We see that, in effect, this model correponds to comparing the Sharpe ratio achieved 
by the optimal decisions x* with the Sharpe Ratio achieved by an omniscient ad- 
versary. Although the absolute robust counterpart problem is tractable when the 
uncertainty set is of cartesian form = ^ x ^q, the relative robust counterpart 
problem is not. We therefore restrict ourselves to the case where only /i is uncertain, 
and Q is known with certainty, that is, — ^ x {Q}. This is a realistic assumption, 
as in practical applications it is typically more interesting to model uncertainty in \x 
only and guard against uncertainty in Q via matrix shrinkage techniques. 

Assuming an uncertainty structure of the form = ^ M x {Q} and using the 
convexification approach described in Appendix B, the above relative robust model is 
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equivalent to 



min 7 (3.19) 

s.t. {n - re) T y > z*{n) -7, W fi £ %, 
y G R + X, 

y T Qy < 1. 

Note that since Q is certain, z* can be considered to be a function of n only. We also 
remark that if we had considered the relative robust counterpart of Problem p.6[) , we 
would have arrived at the same model under the chosen uncertainty structure. 

3.3.3. Complexity of Relative Robust Optimization. All relative robust 
problems introduced above are three-level optimization problems. In contrast to the 
two-level absolute robust models (|3.7j) - (|3.9[) . relative robust problems are generally 
intractable, even for tractable uncertainty sets . This is further illustrated in Section 
[5] in the case where ^ is an ellipsoid and °1/q a singleton. The best we can hope 
to achieve in this case is to identify tractable approximations. Most of Section [5] 
is therefore spent on deriving good polynomial-time solvable inner approximations 
to this problem. Outer approximations - that is, relaxations - that rely on the 
tractability results for polytopic uncertainty sets derived in Section 0] were discussed 
by Taguchi et. al. and Adorn [T]. 

4. Finite and Polytopic Uncertainty Sets. In this section we will show that 
if the uncertainty set is chosen as a polytope - that is, the convex hull of k points 
- or a set of k points, then the relative robust optimization problems p. lip . (I3.12[) . 
(I3.13[) , (|3.14p . p,15[) and (|3.19[) are polynomial-time solvable as a function of k and the 
problem dimension. The complexity is also polynomial in the logarithm of a condition 
number [7], as any conic programming problem, but we will not discuss details here. 

4.1. Solving Problem (|3.15|) . We start by considering Problem (|3.15[) in the 
form (|3.16[) and by assuming that the uncertainty set 

4T = {(a*M QM): (i = l,...,fc)} 

consists of finitely many scenarios. For any given \x and positive definite Q, z*([i, Q) is 
easily computed by solving a convex quadratic optimization problem. Therefore, each 
instance of the last inequality in the formulation (j3. 16[) is a convex quadratic constraint 
that can be efficiently handled using, for example, conic optimization methods. Model 
()3.16p can thus be rewritten by enumerating the possibilities, 

min 7 (4-1) 

cc,7 

x G X, 

^ T x- \x T Q^x > **0x w ,Q yi )-7, (i = l,...fc) 

This shows that the relative robust problem (|3.15p can be solved by first obtaining 
the optimal values z* (fjfi, Qw) and then solving problem (|4.ip as a second-order 
cone programming problem (SOCP) with k convex quadratic constraints. While this 
process may be tedious and time consuming, the resulting formulation is a single level 
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deterministic optimization problem that can be solved efficiently for realistic problem 
sizes both in terms of the dimension and the number of parameter scenarios. 

Next, we consider polytopic uncertainty sets, namely those defined as the convex 
hull of a finite number of extreme scenarios, 

= conv({(/x [i] ,Q w ) : i = l,...,k}). (4.2) 

Using Lemma 12.11 we can now observe that the relative robust model (|3 . 1 5[) that 
corresponds to this uncertainty set is also solved by (14.1[) . This follows immediately 
from the following corollary: 

Corollary 4.1. For °i/ given in (|4.2[) and x £ 1", the following are equivalent, 

i) fi T x — Xx T Qx > z*(/i, Q) — 7 for all (/i, Q) e ^ , 
ii) fi^ T x - Xx T Q^x > z*(fj,M,QM) -7 for (i = l,...,fc). 

Proof. We only need to show that ii) implies i), as the reverse implication is 
trivial. For each (/x, Q) e % there exist weights aW > such that a ^ = 1 an d 

(/Lt, Q) — a' 1 ' (/'''iQ' 1 ')- Multiplying each inequality in ii) by ccM and taking the 

sum, one obtains the required inequality 

fe 

fi T x - Xx T Qx > J2 z * 0* W > Q W ) ~ 7 > fa Q) - 7, 

4=1 

where the second inequality follows from the linearity of the function (/x, Q) h4 /x t x — 
\x T Qx and application of Lemma 12. II □ 

Thus, when the uncertainty set is given as a convex hull the relative robust prob- 
lem (|3T5]) is tractable. Similar results hold for Models ([3~TTjl . (|3~T2|) . ([333]) and 
(|3. 14|) . The reader will find it easy to work out the details. 

4.2. Solving Problem (|3.19|) . Recall that in the case of Model d3H3), we 
assumed the uncertainty set to be of the form = ^ x {Q}, that is, we assumed 
the covariance matrix Q to be known with certainty. In the case where ^ is once 
again given by a finite number of extreme scenarios 

% = {^ ] : i = l,...,fc}, 

the k values z(^) need to be computed by solving convex quadratic programming 
problems of the form (|3.6|) , and then (|3.19[) turns into the convex quadratic program- 
ming problem 

min 7 (4.3) 

7SR,ySR" 

s.t. (pfl - refy > z* (jjfi ) - 7, (» = 1, . . . , jfe) 
y E K+JT, 

y T Qy < 1. 

In the case where ^ is a convex hull of extreme scenarios 
%=conv({n^ : i = l,...,k}), 

one can once again exploit the fact that the function fi z*(ji) is convex, so that 
Lemma |2~T1 implies that (|4.3|) solves Problem (|3.19[) . 
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5. Ellipsoidal Uncertainty Sets. In the remaining sections we assume the 
covariance matrix Q to be known with certainty. The vector of expected returns /i is 
assumed to be uncertain and lie in an ellipsoidal uncertainty set 

% = tp+Mu : ||u|| < 1}, 

where Misanxfc matrix with k < n. Although the ellipsoid need not be full- 
dimensional, in applications it is often natural to choose k — n. Throughout this 
section we treat u or /j, = /u(tt) = ~p+ Mu as the vector of uncertain model parameters 
interchangeably. Further, we assume that the set of feasible decision vectors is of the 
form 

X = {x e R" : Fx = /, Gx < g} , 

where F G R m / X " has full row rank, / 6 R m f , G € R m " xn and g € M m f. We write 
Fi and Gi for the i-th rows of F and G respectively, and 

fp(x) := fi T x - \x T Qx, 

z*(fj,) := max/ M (y), 

vex 

R(x) := max z*(u) — f u (x). 

The relative robust problem we wish to solve is then given by 

(RRP) min R(x). 



5.1. Copositivity Cones. We begin by introducing the technical tools that 
make an analysis of (RRP) possible. Most of the notation is adopted from the eluci- 
dating paper of Sturm and Zhang |24] . 

If D is subset of R™ let 



Jf(D) := clo{z= [%] G 



r > 0, r 



ED} 



be its homogenization, where clo(-) denotes the topological closure. Let q : x H> 
x T Ax + 2b T x + c be an arbitrary quadratic polynomial on E™, where A € y n is a 
symmetric n x n matrix, b 6 M n and cGl, and let 



A 



Then 



q(x) 



where we write X • Y = trace(X T y) for the trace inner product of two matrices of 
equal size (this inner product is the polarization of the Frobenius norm). In what 
follows we will refer to ^#(g) as the matrix representation of q(-). This defines a 
1-1 correspondence between the set of quadratic functions on R™ and the set ,^ n+1 
of symmetric matrices of size (n + 1). In the sequel we will write X y if X is a 
symmetric positive semidefinite matrix and 



(n+l) ^ G yn+1 . Xh Q} 
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Let 

&V+{p) := {si G y n+1 : [f] T ^[T] > Vxefl} 
= G J^" +1 : z T £/z > Vz G Jf(D)} 

be the set of quadratic functions that are nonnegative on D. In the case where 
D = {x : q(x) > 0} for some quadratic polynomial q, & e £+{D) is called the set of 
quadratic functions copositive with q. Here we use an abuse of language and speak 
of ^fio + (D) as the copositivity cone associated with D when D is a more general set. 

Lemma 5.1 (Corollary 1, [M]). J^f+(L>) = conv{zz T : z G (£>)}* . 



Proof. Lemma 15.11 is the same as Corollary 1 in Sturm & Zhang 24 . Here we 
give an alternative proof for completeness. We have 

J^f+OD) = {X G y n+1 : z T Xz > 0, Vz G (£>)} 
= p| {X G : (X; zz T ) > 0} 

= (conv{zz T : z G J^(D)})* . 



Unfortunately, for general D, the cone conv{zz T : z G Jf?(D)}* does not have a 
tractable characterization. For example, when D = K!? then 

Jf(D) =clo({[*] : T>0,r 1 i£R" + }) =R^ +1 , 

and 

^+{D) = conv{zz T : z G 

is the co-positive cone. Testing whether a given matrix belongs to this cone is co- 
NP-hard [IS]. We will see in the sequel that (RRP) is equivalent to solving a conic 
optimization problem with a conic constraint of type ^ r( ^'+(D) for a convex set D 
defined by multiple linear and one quadratic inequality. If the cone ^ C £+{D) is 
intractable, then the conic formulation of (RRP) is intractable too. To render the 
relative robust approach computationally viable in this situation, we will identify a 
tractable convex cone 

K C &g+(D) 

which can be used in an inner approximation of (RRP). Most of the technical details 
regarding the construction of K are deferred to Section [5] 

5.2. A Conic Formulation of (RRP). Introducing an artificial variable 7, 
(RRP) is easily seen to be equivalent to 

(RRP.i) min7 

s.t. x G SC 

7>2*(M)-/„(z) V M G^. 
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Since z*((a) := maxyg^r fn(y), this problem can be further rewritten as 



(RRP.ii) min7 

x,7 



s.t. x e 



(5.1) 



This is a semi-infinite optimization problem, that is, a finite-dimensional problem 
with infinitely many constraints. To render this problem amenable to numerical com- 
putations, we have to replace these infinitely many constraints by a finitely many. 

Parameterizing \x by u, the set of values of y) that appear in the right-hand 

side of (I5.ip corresponds to 



D :={[%]: u T u<l,Fy = f, Gy < g} 



(5.2) 



It follows from Lemma 4 of Sturm-Zhang [53] that the homogenization of this set is 
characterized by 



6 R fc4 
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. 9i 
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-10 0" 
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T _ 




-0 1- 




_ T _ 



>o, 



0, (i = 1,.. . ,m/), 



Further, for fixed (x, 7), the expression 
qx, 7 (u, y) := 7 - / P (y) + / M (x) 







-\M \Q 





M T x 


[y] + 


-/■< 



y ] + (7 — Ax T Qx + /u T x) 



is a quadratic function of (u, y) whose matrix representation in the homogenized space 
is given by 



-±A/ T 

l.^T A , f _l 77 T 



Using Lemma 15.11 Condition (I5.1[) is seen to be the same as 

Jt xn G conv{zz T : z <G ^f(D)}* . 
Therefore, (RRP.ii) can be written in conic form, 



(RRP.iii) min7 

X.J 



(5.3) 



S.t. X £ 

jtft xrt <E conv{zz T : z G Jf?(D)}* . 



5.3. A Tractable Inner Approximation. Let be the trailing n x (n — m/) 
block of the orthogonal factor in the QR-decomposition [* ffjij of -F T , so that the 
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columns of H form a basis of ker(F). Further, let x p G R™ be a particular solution of 
the system Fy = /, and let us write r := n — mt, so that 

{.t e R» : Fx = f} = {x p + Hw: we R r }. 



Let Pq = [o 1], where is a zero row vector of size k + r, and let pj (i = 1, . . . , m g ) 
be the row vectors of the matrix [o —gh g-Gx p j^ where is now a zero matrix of 
size m g x k. And finally, let Q = U T U be the Cholesky factorization of Q. With 
this notation, Corollary 16.111 of Section [5] shows that the following problem is an 
inner approximation of (RRP), where the minimisation is over the decision variables 



w G R r , 7, s, rj, &j el, (i ^ j = 0, . . . , m g ), and n E 



Ui G 



% (i = 0,.. . ,m s ): 



(ARRP) min 7 



s.t. g-Gx p - GHw G M^ 3 ' 

J7,£yeR+, (i^j = 0, ...,m a ), 
[2] G Lfe+i, (z = 0,...,m g ), 








1 



1 

V2 







- 1 - 






2 






s 


UH_ 







XH T QH 





■±H T M 



'p 
-1 



G L n+ 2, 



\M T Hw 
H T (XQx p - |7t) 
7 - As + (Jl - 2XQx p ) T Hw 



- & (p*pJ 



G^_ 



fe+r+l 



By construction, every solution (w, 7, s, 77, ^, r, u) to (ARRP) provides a feasible solu- 
tion (x p + Hw, 7) to (RRP.iii). Since the feasible set of (ARRP) is thus smaller than 
the feasible set of (RRP.iii) , a ( ARRP)-optimal solution (w* ,j*,s*,t]*,£*,t*,u*) does 
not necessarily correspond to a (RRP)-optimal is optimal for (ARRP), this does not 
necessarily imply that (x p +Hw* , 7*) is optimal for (RRP.iii). However, since (ARRP) 
is equivalent to (RRP.iii) in the case m g G {0, 1} (see Corollarv lG.lip . it is reasonable 
to expect that (x p + Hw* ,7*) is a feasible solution to (RRP.iii) which is quite close 
to optimal even in the case m g > 2. 

Note that each of the constraints of (ARRP) is formulated as a conic inequality 
of an expression that is linear in the decision variables. Thus, the great advantage 
of working with the model (ARRP) rather than (RRP) is the fact that, while (RRP) 
may be NP-hard, (ARRP) is readily solvable via standard polynomial-time conic 
programming implementations such as SDTP3 [27] or Sedumi [23] , 



6. Tightness of Inner Approximations. In this section we discuss some of 
the technical details and tightness results surrounding the inner approximation of the 
cones ^"^+(.0) used in Section [SJ 
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6.1. The Case of General D. We begin with the discussion of inner approxi- 
mations of ^"^_|_(-D) where D C R™ is an arbitrary set. 

Lemma 6.1. &<€+{D)* C {X £ : lie e J^(D) Vw 6 (£>)*}. 

Proof. By Lemma 15.11 any X £ ^^+(1))* can be written as a limit X = 
linij^oo Xj, where Xj — YliLi^ij z ij z 7j f° r some 6 Jf(D) and £y > (z = 
1, . . . , fcj). Clearly this implies that X >; 0, and since for any w £ J4?(D)* we have 

X j w = Y,^J j w)z ij £^{D) 

i=l 

and Jf(D) is closed, Xw £ Jf(D). □ 

Taking duals in the inclusion of Lemma 16. 1[ we obtain the following inner ap- 
proximation of ^^+(D), 

&D + (D) D y n+1 + K*, (6.1) 

where 

K = {X £ y n+1 : Xw £ Jt?(D) Vw £ J^(D)*}. (6.2) 

To make this result useful, we need to characterize K* . 

Lemma 6.2. Let C C R™ be a closed convex cone and w £ R n . Then 

{X £ y n : Xw £ C}* = {vw T +wv T : v £ C*} . 

Proof. Consider the linear map ip w (X) = Xw from 5^ n to R". Endowing these 
spaces with their canonical inner products (X,Y) := X • Y and (x,y) := x T y, the 
adjoint map ip* w : R™ +1 — > S^ n+1 is defined by the relation 

(X,ip* w (v)) = (y w {X),v), {X £ -y n+ \v £ M" +1 ). 

The right-hand side in this equation equals (Xw) T v = \{X, (wv T + vw T )), showing 
that 

<P* w {v) = \{wv T + vw T ). (6.3) 

Now we have Xw £ C if and only if (ip w (X),v) > for all v £ C* (using biduality 
and the assumption that C is a closed convex cone). Taking adjoints, this is further 
equivalent to (X, ip* w {v)) > for all v £ C* , and finally to 

X £ «(C*))*. (6.4) 

Since C* is a closed convex cone and (p* w is a linear map between finite-dimensional 
vector spaces, tp* w (C*) is a closed convex cone, so that taking duals in (16.41) yields 

{ley" : Xw£C}* =tp* w (C*). 

Using (|6.3[) . this is seen to be equivalent to the claim of the lemma. □ 
Lemma 6.3. Let K be the cone defined in (|6.2j) . Then 

K* = clo (cone{ww T + vw T : v,w £ J^f(D)*}) . 
17 



Proof. For each w £ J^(D)* , let K w := {X £ ,5^ n : Xw £ Jt?(D)}. Since Jf?(D) 
is a closed convex cone, Lemma [6.21 shows that if* = {ww T + wv T : v £ Jf(D)*}. 
Therefore, we have 

n k~y 

= clo(cone( |J K* w )) 

= clo(cone{ww T + vw T : v, w £ J4?(D)*}), 

as claimed. □ 

Example 6.4. Let D = {x £ R n : b T x > c}. Then Jf(D) = {z £ R( n+1 ) : 
a T z > 0}, where a — [_? c ] and Jrff(D)* = cone{a}. Furthermore, we have 

K = {X £ y n+1 : Xa £ Jf{D)} 
= {X £ ,y n+1 : a T Xa > 0} 
= {X £ y n+1 : X»aa T > 0}, 

so that K* = cone{aa T }. This is confirmed by Lemma \6.3[ which says that K* = 
cone{uw T + vw T : v,w £ cone{a}} = cone{aa T }. 

Example 6.5. Let D = {x E M n : ||x|| < 1}. Then 

jr(D) = {z£R n+1 : * T [-i°]*>0, [?] T 2>0}-[°i]L„ +1 , 

where L n+ i is the n+1- dimensional Lorenz cone or second-order cone and the operator 
[" j q 1 permutes the first component of a vector into last place. Since is self-dual, 

we have 

K = {X£.y n+1 : Xw£ [;j]L„+i Vw£ [?J]£„+i}- (6.5) 
Lemma 1 6'. 51 t/ius shows that K* — cone{ww T + iw : v,w £ [ ? g] 

Combining the inclusion (|6.1[) with Lemma 16.31 we arrive at the following result. 
Theorem 6.6. For any set D C E" if zs trwe i/tai 

&<tf + {D) D .Y n+1 + clo(cone{uw T + vw T : v,w £ Jf{D)*}). 



6.2. Approximation Tightness in the General Case. The inner approx- 
imation of Theorem 16.61 is valid for arbitrary D C M n , and when Jf(D) can be 
explicity characterized it becomes a useful computational tool in conjunction with 
Caratheodory's theorem. It is therefore natural to ask if the inclusion given by the 
theorem is in fact an equality. Unfortunately, this is not true in general, as we shall 
now see. 

Example 6.7. Consider Example \6.5\ again, and note that in this case any 
z £ JtffiD) — L n+ \ satisfies 

[-o I S]-" T = * T [-o I ?]*>0. 
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and the same holds true for convex combinations of matrices zz T with z G \ \ q]L„ + i. 
Thus, using this extra information, we could have used the tighter approximation 

^+{D)* c n k nc, 

where K is defined as in (|6.5[) and 

C := {X e ,y n+1 : [" I ?]«^>0}. 

This yields the inner approximation 

,^ + {D) d y^ +1 + K* + C* 

= y™ +1 + cone{wv T + vw T : v, w G [° J ] L n+1 } + cone { [ " 1 ° ] } (6.6) 

which is strictly larger than the inner approximation 

^+{D) D + cone{uw T + vw T : »,w6[»J] L n+1 } (6.7) 

provided by Theorem \6.b\ as the right-hand side of (|6.7[) does not contain the matrix 
["q 1 J], Further, applying the s-Lemma (see Lemma \6.8\ below) in the context of this 
Example, one finds 

^D + (£>)=^7+ 1 +cone{[- 1 ?]}, (6.8) 

an identity which was first discovered by Rendl- Wolkowicz US\/ . In other words, the 
approximation (|6.6I) is not only an improvement over that of Theorem 1 6. 61 but it is 
in fact tight, that is, the inclusion becomes an equality. Note that this also shows that 

{wv T + vw T : v,we [;j]i„+i}c + cone «]} . (6.9) 

The following classical result from the theory of robust control theory was used 
in the analysis of the above example: 

Lemma 6.8 (s-Lemma, Yakubovich [33]). If D = {x e MP : q(x) > 0}, where 
q(-) is a quadratic function that takes a strictly positive value somewhere, then 

&D + (D) = + cone{^f (g)}. 

For proofs see e.g. [IT] and [T!?] . 

6.3. Improved Approximation for Use in Section [5j Next we will general- 
ize the improved inner approximation (|6.6[) to copositivity cones associated with the 
convex set D we found in (|5.2j) . (15. ip . 

D := {[*] G M fe +" : u T u <l,Fy= f, Gy < g) . 

Recall that F G W n f xn has full row rank, and G G R m " xn . Let H be the trailing 
n x (n — m/) block of the Q-factor of the QR-decomposition [* h]R of F T , so that the 
columns of H form a basis of ker(F). Further, let x p G W 1 be a particular solution of 
the system Fy = /, and let us write r := n — mf, so that 



{y G R" : Fy = /} = {x p + Hw : w G M r }. 
19 



(6.10) 



Consider the linear map 



A 








An 


A12 h 






An 


A 12 H 




A 2 2 b 2 


h 


■> 


T HTA J\ 


H T A 22 H 










p\ + x p A 12 


(b 2 + A 2 2X P ) T H 



b\ + Ai 2 x p 

H T (b2+A 2 2X p ) 



For any quadratic function h on M. k+n with matrix representation g/ g yk+n+i ^ /j A 
be the corresponding quadratic function on R fc+r with matrix representation A (.2/). 
If y = x p + Hw, then by construction of A we have 



h(u, y) > <^> h A (u, w) > 0. 
Let q(u,y) = 1 — u T u, so that 

D = {[^}:q(u,y)>0,[oF}[^=f,[oG][ u y }<g}, 

and note that 



(6.11) 



J?{q) 



- 1 







r-i 



where I is an identity matrix of size k and 0„, r are zero matrices of size n and r 
respectively. Let 

Da :={[:]eK l+r : q A (u,w)>0, [0 Gh] [JJ] < 9 - Gx p } . 

We now obtain the following result, which shows that we can work directly in the 
reduced space R k+r instead of R fc+ ™ : 

Proposition 6.9. 

i) D = {[« ( Xp +ma) T ] T : [u w] T e D A } ; 
n) jr? + (D) = A-^^+Pa))- 

Proof, i) follows from (|6 . 10[) and (I6.1ip . while ii) follows from part i) and (|6. 1 1[) . 

□ 

Next, let Pq — [01], where is a zero row vector of size k + r, and let pj 
(i — 1 , . . . , m g ) be the row vectors of the matrix [0 -GH g-Gz p ] , where is now a zero 
matrix of size m g x k. Then it follows from Lemma 4 in Sturm-Zhang [24 that 



Jf(D A ) = { 



r-i 



r 



z>0,pjz>0,(i = 0,...,m g )\. (6.12) 



Theorem 6.10. An inner approximation of the cone ^ C ^'+{D A ) is given by 



&C + (D A ) D y k+r+1 + cone { [ 1 o r ^ ] } 



+ cone {p^J + pjpj : i ^ j e {0, . . . , m g }} 



i=0 



pj : [;]£% . (6.13) 



Furthermore, if m g € {0, 1} then the inclusion in (|6.13p is an equality. 
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Proof. Lemma |5~T1 and (|6 . 1 2[) establish 



z€R k+r+ \ z T 



z>0, pjz >0,(i = 0,.. .,m g )\ 



c s^l +r+1 n K g n K l n • • • n K l m n *Gr, 



(6.14) 



where 



> 



K q := {X £ y k+r+1 : X • 

K\ := {X G y k+r+l : X ft e ^(£> A )} , (» = 0,. . . ,m s ), 
lf> := {X G ^ fe + r +! : G ^(-D A ) Vu G Jf(D A )*} . 

Using the self-duality of Lk+i and (|6.12|) . we get 

Jf(D A )* = cone{ft : i = 0, . . . ,m ff } + { o ; [!]€%}, 
so that Lemma 16.31 implies 

• {PiPj + PjPi ■ i,j G {0, . . . , 

^+[°] [of: [J],[S]Gi*+l 





' u 
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" u ' 
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T _ 




T 



p7 : [;] EL k+1 \. (6.15) 



Next, using Lemma 15721 to take duals in (|6.14[) . we find 
■^C+ (D A ) D yl+ r+1 + cone { [ ~ 1 o r i ] } 

+ E {^ zT + z P J : ze J?(D A )*} + K*, 

8=0 

Substituting (|6. 15|) into (|6 . 16[) . exploiting the fact that 



(6.16) 
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[1},[Z] GL fc+1 



cone ■ 



{r»'j}. 



which follows from (|6.9p . and using 

cone {pipf + pipf : i = 1, 



fc+r+l 
+ ' 



the inclusion claimed in the theorem is seen to hold true. Furthermore, it follows from 
(16. 8|) that the inclusion is an equality when m g = 0. The fact that this is also true 
for m g = 1 follows from Sturm-Zhang [53], Theorem 3. □ 

Corollary 6.11. Let Q = U T U be the Cholesky factorization of Q. Then the 
following are sufficient conditions for (|5.3|) to hold: 3, rj, £y > 0, (i ^ j = 0, . . . , m g ), 
seK and G Lk+i, (i — 0, . . . , m ff ) suc/i i/iai 



i i 
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V2 \/2 
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and 
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Furthermore, for m g G {0,1} the above conditions are both necessary and sufficient 
for (JO) to hold. 

Proof. With ^#a; i7 defined as in Section [SJ condition (|5 ,3|) is of course the same 
as J$xn € ^^+(1?), see Lemma 15.11 Proposition 16.91 shows that this is further 
equivalent to SS^M xrl ") G ^ c io + (D^). By Theorem 16.101 for this latter condition to 
hold it is sufficient to demand that 
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(6.19) 



Further, since A > 0, (|6.19l) is equivalent to the existence of a s > x T Qx such that 
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+ cone {pjpj + p^p? : i ^ j G {0 
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fe+i 



which is the same as (|6.18l) for some r/, > 0, (i ^ j = 0, . . . , m fl ), and [ ] G ii+fej 
(i = 0, ...,m 9 ). Finally, the condition s > x T Qx is easily seen to be equivalent 
to (|6.17p . The last claim holds because by Theorem 16.101 (|6.19[) is equivalent to 
A(^ Xj7 ) e ^V + (D A ) when m g G {0, 1}. □ 
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7. Appendix A: Proof of Lemma 12.11 Proof. Let pi,p 2 G a i/ and a € [0, 1]. 

Let us first assume that z*(api + (1 — a)P2) < +oo. For e > there exists x € S£ 
such that 

z*(apx + (1 - a)p 2 ) -e< f(x, api + (1 - a)p 2 ) 

< af(x,pi) + (1 - a)f(x,p 2 ) 

< az*(pi) + (1 - a)z*(p 2 ). 

Since this is true for all s, we find 

z*(a Pl + (1 - a)p 2 ) < az*{ Pl ) + (1 - a)z*(p 2 ). (7.1) 
Now assume z*(ap\ + (1 — a)p 2 ) = +oo. Then there exists a sequence C X 
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such that 



f(x n ,ap 1 + (l-a)p 2 ) -> +00. (7.2) 
By the same argument as above, we have 

f(x n ,api + (1 - a)p 2 ) < az*(pi) + (1 - a)z*(p 2 ), 



whence (|7.2[) establishes that at least one of z*(pi) (i — 1, 2) equals +00, so that (|7.1|) 
holds once again. □ 

8. Appendix B: Convexification of Problem (13.51) . We use the notation 
introduced in Section 13.1.21 Using the fact that any 1 e J was assumed to satisfy 
the budget constraint e T x, Problem (|3.5[) has an equivalent formulation 

max/(x)= Q* -re) T x 

s.t. 



The objective function f(x) of this formulation is homogeneous of degree in x. 
Consider also the normalized problem, 

max g(y) = (fi- re) T y (8.2) 
s.t. yeK+S", 

y T Qy = 1, 

where R+^T = {tx : t > 0, 1 g J} is a tractable cone. For example, if X is a 
polyhedron, as it is in most applications, K+J?T is a polyhedral cone. 

Since e T x — 1 for all x <E X and Q y 0, we have x T Qx > for all x € 2£ . 
Therefore, any feasible solution x of (|8.1[) provides y = (x T Qx)^ 1 ' 2 feasible 
solution of (|8.2I) . and furthermore, = /(a;). Conversely, since any feasible y of 
(|8.2|) satisfies y ^ 0, the vector a; = (e T y)~ 1 y £ is feasible for (|8.ip and satisfies 
/(x) = 5(y). This shows that (|8.1I) and (I8.2[) are equivalent: Instead of solving (18. 1|) . 
we may solve (18. 2|) and then construct an optimal solution x* — (e T y*)~ 1 y* of the 
first problem from an optimal solution y* of the second. 

Formulation (|8.2[) is furthermore equivalent to its relaxation 

max 5(2/) = G" - re) T y (8.3) 
s.t. yeE+JT, 

y T Qy < 1, 

since a feasible solution of (I8.3[) cannot be optimal unless y T Qy = 1, assuming that 
there exist feasible y for which g(y) > (if this is not the case, it is not rational to 
invest at all). Thus, the optimal solution y* of (|8.2p may be found by solving the 
tractable convex problem (18.3[) . 



21 



